A general framework for reinforcement learning

نویسنده

  • Csaba Szepesvari
چکیده

In this artide we p1'Opose a geneml framework for sequential dedsion making. The fi'amework is baSf.d on the observation that the. derication of the optimal behaviour ttnde.T various decision criteria follows the. same patte!"n: the cost of policies can be decomposed into the successive applicatlOn of an opemtor that defines the related dynamic programming algordhm and this ope.mtor descnbes complete.ly the structure of the decision problem. vVe take this mapping (the 80 called one step lookahead (OLA) cost mapping) as our starting point. This enables Ihe unified Irealmenl of various decision ,Tile ria (e.g. Ihe expected value (Tilerion OJ' Ihe worsl-case 'Tile1'ion). The main resuli 0./ Ihis arlicle says Ihal Wldt1" minimal condilions oplimal slalionm'Y policies are greedy "W.l·.I. Ihe oplimal cosl Junelion and vice versa. Based on Ihis n:suli "We ./eel Ihal former' resttlis on r'emforcemenl learning can be Imnsfel"1"td 10 olhel' decision (Tileria pl'ovided Ihal lhe decision criltTion is decomposable by an apPl"Opj'iale mapping.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Reinforcement Learning in Neural Networks: A Survey

In recent years, researches on reinforcement learning (RL) have focused on bridging the gap between adaptive optimal control and bio-inspired learning techniques. Neural network reinforcement learning (NNRL) is among the most popular algorithms in the RL framework. The advantage of using neural networks enables the RL to search for optimal policies more efficiently in several real-life applicat...

متن کامل

Reinforcement Learning in Neural Networks: A Survey

In recent years, researches on reinforcement learning (RL) have focused on bridging the gap between adaptive optimal control and bio-inspired learning techniques. Neural network reinforcement learning (NNRL) is among the most popular algorithms in the RL framework. The advantage of using neural networks enables the RL to search for optimal policies more efficiently in several real-life applicat...

متن کامل

Hierarchical Functional Concepts for Knowledge Transfer among Reinforcement Learning Agents

This article introduces the notions of functional space and concept as a way of knowledge representation and abstraction for Reinforcement Learning agents. These definitions are used as a tool of knowledge transfer among agents. The agents are assumed to be heterogeneous; they have different state spaces but share a same dynamic, reward and action space. In other words, the agents are assumed t...

متن کامل

A .net Reinforcement Learning Platform for Multiagent Systems

Reinforcement learning is a convenient way of allowing the agents to autonomously explore and learn the best action sequences that maximize their overall value, based on successive rewards received from the environment. Among other similar libraries and platforms, the reinforcement platform presented here is especially designed to be used with the .NET framework and provides a general support f...

متن کامل

Multiagent Reinforcement Learning in Stochastic Games

We adopt stochastic games as a general framework for dynamic noncooperative systems. This framework provides a way of describing the dynamic interactions of agents in terms of individuals' Markov decision processes. By studying this framework, we go beyond the common practice in the study of learning in games, which primarily focus on repeated games or extensive-form games. For stochastic games...

متن کامل

Deep Reinforcement Learning as Foundation for Artificial General Intelligence

Deep machine learning and reinforcement learning are two complementing fields within the study of intelligent systems. When combined, it is argued that they offer a promising path for achieving artificial general intelligence (AGI). This chapter outlines the concepts facilitating such merger of technologies and motivates a framework for building scalable intelligent machines. The prospect of ut...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1995